Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 6523 |
| Missing cells | 2570 |
| Missing cells (%) | 2.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.1 MiB |
| Average record size in memory | 505.2 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 5 |
name has a high cardinality: 6351 distinct values | High cardinality |
host_name has a high cardinality: 1902 distinct values | High cardinality |
neighbourhood has a high cardinality: 77 distinct values | High cardinality |
last_review has a high cardinality: 820 distinct values | High cardinality |
last_review has 1285 (19.7%) missing values | Missing |
reviews_per_month has 1285 (19.7%) missing values | Missing |
price is highly skewed (γ1 = 20.25399533) | Skewed |
name is uniformly distributed | Uniform |
id has unique values | Unique |
number_of_reviews has 1285 (19.7%) zeros | Zeros |
availability_365 has 1797 (27.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-01-20 02:39:47.185658 |
|---|---|
| Analysis finished | 2021-01-20 02:40:03.252345 |
| Duration | 16.07 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 6523 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29368180.91 |
|---|---|
| Minimum | 2384 |
| Maximum | 47141177 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 2384 |
|---|---|
| 5-th percentile | 4245243 |
| Q1 | 19478989 |
| median | 31990014 |
| Q3 | 41310540 |
| 95-th percentile | 46331194.7 |
| Maximum | 47141177 |
| Range | 47138793 |
| Interquartile range (IQR) | 21831551 |
Descriptive statistics
| Standard deviation | 13434526.95 |
|---|---|
| Coefficient of variation (CV) | 0.4574517907 |
| Kurtosis | -0.9110283935 |
| Mean | 29368180.91 |
| Median Absolute Deviation (MAD) | 10450481 |
| Skewness | -0.4930470015 |
| Sum | 1.915686441e+11 |
| Variance | 1.804865143e+14 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 43421694 | 1 | < 0.1% | |
| 45669817 | 1 | < 0.1% | |
| 15730074 | 1 | < 0.1% | |
| 9610654 | 1 | < 0.1% | |
| 7408812 | 1 | < 0.1% | |
| 20379044 | 1 | < 0.1% | |
| 45508006 | 1 | < 0.1% | |
| 47054253 | 1 | < 0.1% | |
| 31225961 | 1 | < 0.1% | |
| 41620912 | 1 | < 0.1% | |
| 6086065 | 1 | < 0.1% | |
| 39964083 | 1 | < 0.1% | |
| 45237690 | 1 | < 0.1% | |
| 13972956 | 1 | < 0.1% | |
| 42720515 | 1 | < 0.1% | |
| 37371325 | 1 | < 0.1% | |
| 42272194 | 1 | < 0.1% | |
| 20605570 | 1 | < 0.1% | |
| 43834825 | 1 | < 0.1% | |
| 22449611 | 1 | < 0.1% | |
| 13532621 | 1 | < 0.1% | |
| 25343444 | 1 | < 0.1% | |
| 7370245 | 1 | < 0.1% | |
| 26959321 | 1 | < 0.1% | |
| 37707161 | 1 | < 0.1% | |
| Other values (6498) | 6498 | 99.6% |
| Value | Count | Frequency (%) | |
| 2384 | 1 | < 0.1% | |
| 4505 | 1 | < 0.1% | |
| 7126 | 1 | < 0.1% | |
| 9811 | 1 | < 0.1% | |
| 10610 | 1 | < 0.1% | |
| 10945 | 1 | < 0.1% | |
| 12140 | 1 | < 0.1% | |
| 22362 | 1 | < 0.1% | |
| 24833 | 1 | < 0.1% | |
| 25879 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 47141177 | 1 | < 0.1% | |
| 47140245 | 1 | < 0.1% | |
| 47137445 | 1 | < 0.1% | |
| 47126361 | 1 | < 0.1% | |
| 47126307 | 1 | < 0.1% | |
| 47123944 | 1 | < 0.1% | |
| 47121422 | 1 | < 0.1% | |
| 47118155 | 1 | < 0.1% | |
| 47116894 | 1 | < 0.1% | |
| 47115140 | 1 | < 0.1% |
| Distinct | 6351 |
|---|---|
| Distinct (%) | 97.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.1 KiB |
| Live + Work + Stay + Easy | 1BR in Chicago | 18 |
|---|---|
| UChicago, Shops + Eats, Lake | Gym + W&D | Zencity | 14 |
| Traveler's Dream - 1 bed in a shared bedroom | 10 |
| UChicago, Lake, Sci Museum | Gym + W&D | Zencity | 9 |
| Hotel Perks - Private Bedroom | Private Bathroom | 8 |
| Other values (6346) |
| Value | Count | Frequency (%) | |
| Live + Work + Stay + Easy | 1BR in Chicago | 18 | 0.3% | |
| UChicago, Shops + Eats, Lake | Gym + W&D | Zencity | 14 | 0.2% | |
| Traveler's Dream - 1 bed in a shared bedroom | 10 | 0.2% | |
| UChicago, Lake, Sci Museum | Gym + W&D | Zencity | 9 | 0.1% | |
| Hotel Perks - Private Bedroom | Private Bathroom | 8 | 0.1% | |
| Live + Work + Stay + Easy | 3BR in Chicago | 7 | 0.1% | |
| Stylish & Ample 1PBR STAYCATION SPACE For You | 6 | 0.1% | |
| Steps to MI Ave Shops | View, Beach, Gym | Zencity | 6 | 0.1% | |
| Enjoy the Lakefront from a Cozy Retreat | 5 | 0.1% | |
| Steps to Shop, Eat, Train | Easy Access | Zencity | 4 | 0.1% | |
| Live + Work + Stay + Easy | 2BR in Chicago | 4 | 0.1% | |
| A home you will love | 2BR in Chicago | 4 | 0.1% | |
| Steps to UChicago | Easy Access + W&D | Zencity | 4 | 0.1% | |
| Entire apartment for you | 2BR in Chicago | 3 | < 0.1% | |
| Steps to Shops, Eats | Easy Access + W&D | Zencity | 3 | < 0.1% | |
| BEST LOCATION EVER!WRIGLEY - BOYSTOWN - LAKEVIEW 1 | 3 | < 0.1% | |
| SUPER EARLY CHECK IN AND SUPER LATE CHECK OUT | 3 | < 0.1% | |
| XL Penthouse"The Harper"Book 6 Nights Get 1 FREE | 3 | < 0.1% | |
| LAKEVIEW HEART! BOYSTOWN -WRIGLEY "HOSTEL STYLE" 2 | 3 | < 0.1% | |
| All-inclusive apartment home | 3BR in Chicago | 3 | < 0.1% | |
| Gritty Chic River North + ACME Hotel | 3 | < 0.1% | |
| Classic HP 1BR with Fast Transit to UChicago & DT by Zen Rentals | 3 | < 0.1% | |
| Professionally maintained apt | 2BR in Chicago | 3 | < 0.1% | |
| 5min to Wicker & DT | Lux Flat + W&D | Zencity | 3 | < 0.1% | |
| Bright Loop 1BR w/ Gym, Pool, nr. Financial District, by Blueground | 3 | < 0.1% | |
| Other values (6326) | 6388 | 97.9% |
Frequencies of value counts
Unique
| Unique | 6264 ? |
|---|---|
| Unique (%) | 96.0% |
Histogram of lengths of the category
Length
| Max length | 206 |
|---|---|
| Median length | 45 |
| Mean length | 41.40870765 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 38673 | 14.3% | ||
| e | 19259 | 7.1% | |
| o | 18576 | 6.9% | |
| a | 14204 | 5.3% | |
| i | 13932 | 5.2% | |
| t | 13461 | 5.0% | |
| n | 13294 | 4.9% | |
| r | 13191 | 4.9% | |
| l | 7283 | 2.7% | |
| s | 6152 | 2.3% | |
| u | 5797 | 2.1% | |
| d | 5730 | 2.1% | |
| m | 5086 | 1.9% | |
| c | 5044 | 1.9% | |
| h | 4822 | 1.8% | |
| g | 4436 | 1.6% | |
| y | 4112 | 1.5% | |
| B | 3864 | 1.4% | |
| C | 3697 | 1.4% | |
| p | 3496 | 1.3% | |
| w | 3372 | 1.2% | |
| S | 3327 | 1.2% | |
| R | 3156 | 1.2% | |
| L | 3143 | 1.2% | |
| P | 2966 | 1.1% | |
| Other values (314) | 50036 | 18.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 172960 | 64.0% | |
| Uppercase Letter | 43463 | 16.1% | |
| Space Separator | 38727 | 14.3% | |
| Other Punctuation | 6603 | 2.4% | |
| Decimal Number | 4770 | 1.8% | |
| Dash Punctuation | 1288 | 0.5% | |
| Math Symbol | 999 | 0.4% | |
| Other Symbol | 284 | 0.1% | |
| Other Letter | 271 | 0.1% | |
| Open Punctuation | 262 | 0.1% | |
| Close Punctuation | 254 | 0.1% | |
| Nonspacing Mark | 101 | < 0.1% | |
| Final Punctuation | 60 | < 0.1% | |
| Currency Symbol | 26 | < 0.1% | |
| Control | 16 | < 0.1% | |
| Initial Punctuation | 11 | < 0.1% | |
| Spacing Mark | 9 | < 0.1% | |
| Format | 3 | < 0.1% | |
| Modifier Symbol | 2 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 3864 | 8.9% | |
| C | 3697 | 8.5% | |
| S | 3327 | 7.7% | |
| R | 3156 | 7.3% | |
| L | 3143 | 7.2% | |
| P | 2966 | 6.8% | |
| A | 2533 | 5.8% | |
| E | 1982 | 4.6% | |
| T | 1907 | 4.4% | |
| H | 1784 | 4.1% | |
| N | 1686 | 3.9% | |
| O | 1656 | 3.8% | |
| W | 1628 | 3.7% | |
| M | 1621 | 3.7% | |
| D | 1563 | 3.6% | |
| G | 1432 | 3.3% | |
| I | 1252 | 2.9% | |
| F | 1187 | 2.7% | |
| U | 968 | 2.2% | |
| V | 695 | 1.6% | |
| K | 406 | 0.9% | |
| Y | 383 | 0.9% | |
| Q | 244 | 0.6% | |
| Z | 117 | 0.3% | |
| X | 117 | 0.3% | |
| Other values (21) | 149 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 19259 | 11.1% | |
| o | 18576 | 10.7% | |
| a | 14204 | 8.2% | |
| i | 13932 | 8.1% | |
| t | 13461 | 7.8% | |
| n | 13294 | 7.7% | |
| r | 13191 | 7.6% | |
| l | 7283 | 4.2% | |
| s | 6152 | 3.6% | |
| u | 5797 | 3.4% | |
| d | 5730 | 3.3% | |
| m | 5086 | 2.9% | |
| c | 5044 | 2.9% | |
| h | 4822 | 2.8% | |
| g | 4436 | 2.6% | |
| y | 4112 | 2.4% | |
| p | 3496 | 2.0% | |
| w | 3372 | 1.9% | |
| k | 2856 | 1.7% | |
| f | 2452 | 1.4% | |
| v | 2270 | 1.3% | |
| b | 2074 | 1.2% | |
| z | 744 | 0.4% | |
| x | 624 | 0.4% | |
| q | 488 | 0.3% | |
| Other values (41) | 205 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 38673 | 99.9% | ||
| 53 | 0.1% | ||
| 1 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1267 | 98.4% | |
| — | 11 | 0.9% | |
| – | 10 | 0.8% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 1800 | 27.3% | |
| / | 1530 | 23.2% | |
| ! | 1010 | 15.3% | |
| & | 731 | 11.1% | |
| . | 675 | 10.2% | |
| ' | 203 | 3.1% | |
| # | 159 | 2.4% | |
| * | 151 | 2.3% | |
| : | 102 | 1.5% | |
| • | 69 | 1.0% | |
| " | 60 | 0.9% | |
| @ | 31 | 0.5% | |
| ; | 27 | 0.4% | |
| ? | 11 | 0.2% | |
| ! | 10 | 0.2% | |
| , | 9 | 0.1% | |
| % | 6 | 0.1% | |
| 。 | 6 | 0.1% | |
| % | 5 | 0.1% | |
| 、 | 4 | 0.1% | |
| / | 2 | < 0.1% | |
| \ | 1 | < 0.1% | |
| @ | 1 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 1549 | 32.5% | |
| 1 | 1184 | 24.8% | |
| 3 | 692 | 14.5% | |
| 0 | 347 | 7.3% | |
| 4 | 329 | 6.9% | |
| 5 | 277 | 5.8% | |
| 6 | 133 | 2.8% | |
| 9 | 98 | 2.1% | |
| 8 | 73 | 1.5% | |
| 7 | 69 | 1.4% | |
| 4 | 5 | 0.1% | |
| 0 | 5 | 0.1% | |
| 2 | 5 | 0.1% | |
| 1 | 3 | 0.1% | |
| 3 | 1 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 244 | 93.1% | |
| [ | 7 | 2.7% | |
| 【 | 6 | 2.3% | |
| ( | 3 | 1.1% | |
| 《 | 2 | 0.8% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 241 | 94.9% | |
| 】 | 6 | 2.4% | |
| ) | 4 | 1.6% | |
| 》 | 2 | 0.8% | |
| ] | 1 | 0.4% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| | | 528 | 52.9% | |
| + | 450 | 45.0% | |
| ~ | 13 | 1.3% | |
| | | 3 | 0.3% | |
| ⟣ | 1 | 0.1% | |
| ⟢ | 1 | 0.1% | |
| = | 1 | 0.1% | |
| < | 1 | 0.1% | |
| > | 1 | 0.1% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| ★ | 52 | 18.3% | |
| ❤ | 46 | 16.2% | |
| ⭐ | 33 | 11.6% | |
| │ | 15 | 5.3% | |
| ♥ | 10 | 3.5% | |
| 🏠 | 8 | 2.8% | |
| ⚡ | 7 | 2.5% | |
| ✔ | 7 | 2.5% | |
| ▌ | 6 | 2.1% | |
| ◆ | 6 | 2.1% | |
| 💎 | 5 | 1.8% | |
| ⚜ | 5 | 1.8% | |
| ➟ | 5 | 1.8% | |
| ✪ | 4 | 1.4% | |
| ♫ | 4 | 1.4% | |
| ♬ | 4 | 1.4% | |
| ✭ | 4 | 1.4% | |
| ♔ | 4 | 1.4% | |
| ✦ | 4 | 1.4% | |
| ✨ | 4 | 1.4% | |
| ☕ | 3 | 1.1% | |
| 🥇 | 3 | 1.1% | |
| 🌸 | 3 | 1.1% | |
| 💗 | 2 | 0.7% | |
| ✯ | 2 | 0.7% | |
| Other values (28) | 38 | 13.4% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 16 | 100.0% |
Most frequent Nonspacing Mark characters
| Value | Count | Frequency (%) | |
| ️ | 81 | 80.2% | |
| ् | 9 | 8.9% | |
| े | 9 | 8.9% | |
| ً | 2 | 2.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 2 | 100.0% |
Most frequent Initial Punctuation characters
| Value | Count | Frequency (%) | |
| “ | 9 | 81.8% | |
| « | 1 | 9.1% | |
| ‘ | 1 | 9.1% |
Most frequent Final Punctuation characters
| Value | Count | Frequency (%) | |
| ’ | 51 | 85.0% | |
| ” | 8 | 13.3% | |
| » | 1 | 1.7% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 26 | 100.0% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| ه | 11 | 4.1% | |
| ل | 11 | 4.1% | |
| ا | 11 | 4.1% | |
| 中 | 11 | 4.1% | |
| 迎 | 10 | 3.7% | |
| أ | 10 | 3.7% | |
| 唐 | 10 | 3.7% | |
| 歡 | 9 | 3.3% | |
| ب | 9 | 3.3% | |
| ك | 9 | 3.3% | |
| स | 9 | 3.3% | |
| व | 9 | 3.3% | |
| ग | 9 | 3.3% | |
| त | 9 | 3.3% | |
| ह | 9 | 3.3% | |
| 房 | 5 | 1.8% | |
| 人 | 4 | 1.5% | |
| 街 | 4 | 1.5% | |
| 床 | 4 | 1.5% | |
| 城 | 3 | 1.1% | |
| 近 | 3 | 1.1% | |
| 的 | 3 | 1.1% | |
| 和 | 2 | 0.7% | |
| 车 | 2 | 0.7% | |
| 个 | 2 | 0.7% | |
| Other values (71) | 93 | 34.3% |
Most frequent Spacing Mark characters
| Value | Count | Frequency (%) | |
| ा | 9 | 100.0% |
Most frequent Format characters
| Value | Count | Frequency (%) | |
| | 3 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 216385 | 80.1% | |
| Common | 53343 | 19.7% | |
| Han | 153 | 0.1% | |
| Inherited | 83 | < 0.1% | |
| Devanagari | 72 | < 0.1% | |
| Arabic | 63 | < 0.1% | |
| Hangul | 10 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 19259 | 8.9% | |
| o | 18576 | 8.6% | |
| a | 14204 | 6.6% | |
| i | 13932 | 6.4% | |
| t | 13461 | 6.2% | |
| n | 13294 | 6.1% | |
| r | 13191 | 6.1% | |
| l | 7283 | 3.4% | |
| s | 6152 | 2.8% | |
| u | 5797 | 2.7% | |
| d | 5730 | 2.6% | |
| m | 5086 | 2.4% | |
| c | 5044 | 2.3% | |
| h | 4822 | 2.2% | |
| g | 4436 | 2.1% | |
| y | 4112 | 1.9% | |
| B | 3864 | 1.8% | |
| C | 3697 | 1.7% | |
| p | 3496 | 1.6% | |
| w | 3372 | 1.6% | |
| S | 3327 | 1.5% | |
| R | 3156 | 1.5% | |
| L | 3143 | 1.5% | |
| P | 2966 | 1.4% | |
| k | 2856 | 1.3% | |
| Other values (63) | 32129 | 14.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 38673 | 72.5% | ||
| , | 1800 | 3.4% | |
| 2 | 1549 | 2.9% | |
| / | 1530 | 2.9% | |
| - | 1267 | 2.4% | |
| 1 | 1184 | 2.2% | |
| ! | 1010 | 1.9% | |
| & | 731 | 1.4% | |
| 3 | 692 | 1.3% | |
| . | 675 | 1.3% | |
| | | 528 | 1.0% | |
| + | 450 | 0.8% | |
| 0 | 347 | 0.7% | |
| 4 | 329 | 0.6% | |
| 5 | 277 | 0.5% | |
| ( | 244 | 0.5% | |
| ) | 241 | 0.5% | |
| ' | 203 | 0.4% | |
| # | 159 | 0.3% | |
| * | 151 | 0.3% | |
| 6 | 133 | 0.2% | |
| : | 102 | 0.2% | |
| 9 | 98 | 0.2% | |
| 8 | 73 | 0.1% | |
| 7 | 69 | 0.1% | |
| Other values (125) | 828 | 1.6% |
Most frequent Inherited characters
| Value | Count | Frequency (%) | |
| ️ | 81 | 97.6% | |
| ً | 2 | 2.4% |
Most frequent Han characters
| Value | Count | Frequency (%) | |
| 中 | 11 | 7.2% | |
| 迎 | 10 | 6.5% | |
| 唐 | 10 | 6.5% | |
| 歡 | 9 | 5.9% | |
| 房 | 5 | 3.3% | |
| 人 | 4 | 2.6% | |
| 街 | 4 | 2.6% | |
| 床 | 4 | 2.6% | |
| 城 | 3 | 2.0% | |
| 近 | 3 | 2.0% | |
| 的 | 3 | 2.0% | |
| 和 | 2 | 1.3% | |
| 车 | 2 | 1.3% | |
| 个 | 2 | 1.3% | |
| 室 | 2 | 1.3% | |
| 文 | 2 | 1.3% | |
| 东 | 2 | 1.3% | |
| 风 | 2 | 1.3% | |
| 新 | 2 | 1.3% | |
| 超 | 2 | 1.3% | |
| 市 | 2 | 1.3% | |
| 酒 | 2 | 1.3% | |
| 吧 | 2 | 1.3% | |
| 歺 | 2 | 1.3% | |
| 芝 | 2 | 1.3% | |
| Other values (53) | 59 | 38.6% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ه | 11 | 17.5% | |
| ل | 11 | 17.5% | |
| ا | 11 | 17.5% | |
| أ | 10 | 15.9% | |
| ب | 9 | 14.3% | |
| ك | 9 | 14.3% | |
| و | 1 | 1.6% | |
| س | 1 | 1.6% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| स | 9 | 12.5% | |
| ् | 9 | 12.5% | |
| व | 9 | 12.5% | |
| ा | 9 | 12.5% | |
| ग | 9 | 12.5% | |
| त | 9 | 12.5% | |
| ह | 9 | 12.5% | |
| े | 9 | 12.5% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 시 | 2 | 20.0% | |
| 카 | 2 | 20.0% | |
| 고 | 2 | 20.0% | |
| 민 | 2 | 20.0% | |
| 박 | 2 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 268905 | 99.6% | |
| None | 432 | 0.2% | |
| Punctuation | 159 | 0.1% | |
| CJK | 153 | 0.1% | |
| Misc Symbols | 97 | < 0.1% | |
| Dingbats | 85 | < 0.1% | |
| VS | 81 | < 0.1% | |
| Devanagari | 72 | < 0.1% | |
| Arabic | 65 | < 0.1% | |
| Math Alphanum | 38 | < 0.1% | |
| Hangul | 10 | < 0.1% | |
| Block Elements | 6 | < 0.1% | |
| Geometric Shapes | 6 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 38673 | 14.4% | ||
| e | 19259 | 7.2% | |
| o | 18576 | 6.9% | |
| a | 14204 | 5.3% | |
| i | 13932 | 5.2% | |
| t | 13461 | 5.0% | |
| n | 13294 | 4.9% | |
| r | 13191 | 4.9% | |
| l | 7283 | 2.7% | |
| s | 6152 | 2.3% | |
| u | 5797 | 2.2% | |
| d | 5730 | 2.1% | |
| m | 5086 | 1.9% | |
| c | 5044 | 1.9% | |
| h | 4822 | 1.8% | |
| g | 4436 | 1.6% | |
| y | 4112 | 1.5% | |
| B | 3864 | 1.4% | |
| C | 3697 | 1.4% | |
| p | 3496 | 1.3% | |
| w | 3372 | 1.3% | |
| S | 3327 | 1.2% | |
| R | 3156 | 1.2% | |
| L | 3143 | 1.2% | |
| P | 2966 | 1.1% | |
| Other values (67) | 48832 | 18.2% |
Most frequent Misc Symbols characters
| Value | Count | Frequency (%) | |
| ★ | 52 | 53.6% | |
| ♥ | 10 | 10.3% | |
| ⚡ | 7 | 7.2% | |
| ⚜ | 5 | 5.2% | |
| ♫ | 4 | 4.1% | |
| ♬ | 4 | 4.1% | |
| ♔ | 4 | 4.1% | |
| ☕ | 3 | 3.1% | |
| ♕ | 2 | 2.1% | |
| ☺ | 1 | 1.0% | |
| ☙ | 1 | 1.0% | |
| ☾ | 1 | 1.0% | |
| ☽ | 1 | 1.0% | |
| ♡ | 1 | 1.0% | |
| ⚽ | 1 | 1.0% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| 53 | 12.3% | ||
| ⭐ | 33 | 7.6% | |
| C | 19 | 4.4% | |
| │ | 15 | 3.5% | |
| e | 14 | 3.2% | |
| n | 13 | 3.0% | |
| f | 12 | 2.8% | |
| i | 11 | 2.5% | |
| o | 11 | 2.5% | |
| ó | 10 | 2.3% | |
| ! | 10 | 2.3% | |
| D | 10 | 2.3% | |
| , | 9 | 2.1% | |
| t | 9 | 2.1% | |
| 🏠 | 8 | 1.9% | |
| a | 7 | 1.6% | |
| 【 | 6 | 1.4% | |
| 】 | 6 | 1.4% | |
| 。 | 6 | 1.4% | |
| T | 6 | 1.4% | |
| c | 6 | 1.4% | |
| B | 6 | 1.4% | |
| 💎 | 5 | 1.2% | |
| 4 | 5 | 1.2% | |
| 0 | 5 | 1.2% | |
| Other values (60) | 137 | 31.7% |
Most frequent Dingbats characters
| Value | Count | Frequency (%) | |
| ❤ | 46 | 54.1% | |
| ✔ | 7 | 8.2% | |
| ➟ | 5 | 5.9% | |
| ✪ | 4 | 4.7% | |
| ✭ | 4 | 4.7% | |
| ✦ | 4 | 4.7% | |
| ✨ | 4 | 4.7% | |
| ✯ | 2 | 2.4% | |
| ✈ | 2 | 2.4% | |
| ✵ | 2 | 2.4% | |
| ✿ | 2 | 2.4% | |
| ✱ | 2 | 2.4% | |
| ❧ | 1 | 1.2% |
Most frequent VS characters
| Value | Count | Frequency (%) | |
| ️ | 81 | 100.0% |
Most frequent Punctuation characters
| Value | Count | Frequency (%) | |
| • | 69 | 43.4% | |
| ’ | 51 | 32.1% | |
| — | 11 | 6.9% | |
| – | 10 | 6.3% | |
| “ | 9 | 5.7% | |
| ” | 8 | 5.0% | |
| ‘ | 1 | 0.6% |
Most frequent CJK characters
| Value | Count | Frequency (%) | |
| 中 | 11 | 7.2% | |
| 迎 | 10 | 6.5% | |
| 唐 | 10 | 6.5% | |
| 歡 | 9 | 5.9% | |
| 房 | 5 | 3.3% | |
| 人 | 4 | 2.6% | |
| 街 | 4 | 2.6% | |
| 床 | 4 | 2.6% | |
| 城 | 3 | 2.0% | |
| 近 | 3 | 2.0% | |
| 的 | 3 | 2.0% | |
| 和 | 2 | 1.3% | |
| 车 | 2 | 1.3% | |
| 个 | 2 | 1.3% | |
| 室 | 2 | 1.3% | |
| 文 | 2 | 1.3% | |
| 东 | 2 | 1.3% | |
| 风 | 2 | 1.3% | |
| 新 | 2 | 1.3% | |
| 超 | 2 | 1.3% | |
| 市 | 2 | 1.3% | |
| 酒 | 2 | 1.3% | |
| 吧 | 2 | 1.3% | |
| 歺 | 2 | 1.3% | |
| 芝 | 2 | 1.3% | |
| Other values (53) | 59 | 38.6% |
Most frequent Arabic characters
| Value | Count | Frequency (%) | |
| ه | 11 | 16.9% | |
| ل | 11 | 16.9% | |
| ا | 11 | 16.9% | |
| أ | 10 | 15.4% | |
| ب | 9 | 13.8% | |
| ك | 9 | 13.8% | |
| ً | 2 | 3.1% | |
| و | 1 | 1.5% | |
| س | 1 | 1.5% |
Most frequent Devanagari characters
| Value | Count | Frequency (%) | |
| स | 9 | 12.5% | |
| ् | 9 | 12.5% | |
| व | 9 | 12.5% | |
| ा | 9 | 12.5% | |
| ग | 9 | 12.5% | |
| त | 9 | 12.5% | |
| ह | 9 | 12.5% | |
| े | 9 | 12.5% |
Most frequent Block Elements characters
| Value | Count | Frequency (%) | |
| ▌ | 6 | 100.0% |
Most frequent Hangul characters
| Value | Count | Frequency (%) | |
| 시 | 2 | 20.0% | |
| 카 | 2 | 20.0% | |
| 고 | 2 | 20.0% | |
| 민 | 2 | 20.0% | |
| 박 | 2 | 20.0% |
Most frequent Math Alphanum characters
| Value | Count | Frequency (%) | |
| 𝗋 | 3 | 7.9% | |
| 𝖺 | 3 | 7.9% | |
| 𝗂 | 3 | 7.9% | |
| 𝖻 | 2 | 5.3% | |
| 𝗇 | 2 | 5.3% | |
| 𝗍 | 2 | 5.3% | |
| 𝗎 | 2 | 5.3% | |
| 𝖽 | 2 | 5.3% | |
| 𝗈 | 2 | 5.3% | |
| 𝗄 | 2 | 5.3% | |
| 𝖤 | 2 | 5.3% | |
| 𝖴 | 1 | 2.6% | |
| 𝖢 | 1 | 2.6% | |
| 𝗁 | 1 | 2.6% | |
| 𝖼 | 1 | 2.6% | |
| 𝖲 | 1 | 2.6% | |
| 𝖧 | 1 | 2.6% | |
| 𝗆 | 1 | 2.6% | |
| 𝗅 | 1 | 2.6% | |
| 𝖯 | 1 | 2.6% | |
| 𝖥 | 1 | 2.6% | |
| 𝖱 | 1 | 2.6% | |
| 𝗉 | 1 | 2.6% | |
| 𝗀 | 1 | 2.6% |
Most frequent Geometric Shapes characters
| Value | Count | Frequency (%) | |
| ◆ | 6 | 100.0% |
host_id
Real number (ℝ≥0)
| Distinct | 3553 |
|---|---|
| Distinct (%) | 54.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 106119284.1 |
|---|---|
| Minimum | 2140 |
| Maximum | 380761555 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 2140 |
|---|---|
| 5-th percentile | 1523290 |
| Q1 | 18607228.5 |
| median | 63226994 |
| Q3 | 170785489 |
| 95-th percentile | 337787048.2 |
| Maximum | 380761555 |
| Range | 380759415 |
| Interquartile range (IQR) | 152178260.5 |
Descriptive statistics
| Standard deviation | 106308403.9 |
|---|---|
| Coefficient of variation (CV) | 1.001782144 |
| Kurtosis | -0.1264015088 |
| Mean | 106119284.1 |
| Median Absolute Deviation (MAD) | 54894837 |
| Skewness | 0.9918141307 |
| Sum | 6.922160899e+11 |
| Variance | 1.130147675e+16 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 107434423 | 216 | 3.3% | |
| 3965428 | 74 | 1.1% | |
| 47172572 | 63 | 1.0% | |
| 12243051 | 50 | 0.8% | |
| 359234447 | 49 | 0.8% | |
| 170785489 | 47 | 0.7% | |
| 8534462 | 40 | 0.6% | |
| 9094538 | 40 | 0.6% | |
| 166918192 | 35 | 0.5% | |
| 49626033 | 31 | 0.5% | |
| 63313003 | 30 | 0.5% | |
| 148973907 | 30 | 0.5% | |
| 100782278 | 29 | 0.4% | |
| 33127842 | 26 | 0.4% | |
| 217094024 | 25 | 0.4% | |
| 57387860 | 25 | 0.4% | |
| 371036931 | 24 | 0.4% | |
| 98193524 | 20 | 0.3% | |
| 244000490 | 20 | 0.3% | |
| 683529 | 20 | 0.3% | |
| 178710732 | 19 | 0.3% | |
| 257464365 | 19 | 0.3% | |
| 35781467 | 18 | 0.3% | |
| 2907254 | 18 | 0.3% | |
| 154630260 | 18 | 0.3% | |
| Other values (3528) | 5537 | 84.9% |
| Value | Count | Frequency (%) | |
| 2140 | 2 | < 0.1% | |
| 2153 | 6 | 0.1% | |
| 2613 | 1 | < 0.1% | |
| 4434 | 5 | 0.1% | |
| 5775 | 1 | < 0.1% | |
| 6162 | 1 | < 0.1% | |
| 9301 | 1 | < 0.1% | |
| 11278 | 1 | < 0.1% | |
| 13014 | 1 | < 0.1% | |
| 17928 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 380761555 | 1 | < 0.1% | |
| 380437001 | 1 | < 0.1% | |
| 380353393 | 1 | < 0.1% | |
| 379312368 | 2 | < 0.1% | |
| 379039129 | 1 | < 0.1% | |
| 378666753 | 1 | < 0.1% | |
| 378245394 | 1 | < 0.1% | |
| 378142375 | 1 | < 0.1% | |
| 377995844 | 2 | < 0.1% | |
| 377980684 | 1 | < 0.1% |
| Distinct | 1902 |
|---|---|
| Distinct (%) | 29.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.1 KiB |
| Blueground | 216 |
|---|---|
| Rob | 82 |
| Zencity | 63 |
| Joe | 63 |
| John | 61 |
| Other values (1897) |
| Value | Count | Frequency (%) | |
| Blueground | 216 | 3.3% | |
| Rob | 82 | 1.3% | |
| Zencity | 63 | 1.0% | |
| Joe | 63 | 1.0% | |
| John | 61 | 0.9% | |
| Michael | 60 | 0.9% | |
| Nicole | 58 | 0.9% | |
| Kia | 53 | 0.8% | |
| Sonder | 50 | 0.8% | |
| David | 47 | 0.7% | |
| Dmd | 47 | 0.7% | |
| Alex | 44 | 0.7% | |
| Helen | 41 | 0.6% | |
| Barsala | 40 | 0.6% | |
| Brad & Sara | 35 | 0.5% | |
| William | 34 | 0.5% | |
| Dan | 34 | 0.5% | |
| Matthew | 33 | 0.5% | |
| Roma | 31 | 0.5% | |
| Mia & Noah | 30 | 0.5% | |
| K | 30 | 0.5% | |
| Matt | 29 | 0.4% | |
| Emily & Rich | 29 | 0.4% | |
| Kari | 28 | 0.4% | |
| Kevin | 28 | 0.4% | |
| Other values (1877) | 5257 | 80.6% |
Frequencies of value counts
Unique
| Unique | 1071 ? |
|---|---|
| Unique (%) | 16.4% |
Histogram of lengths of the category
Length
| Max length | 35 |
|---|---|
| Median length | 6 |
| Mean length | 6.478460831 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 4727 | 11.2% | |
| e | 3883 | 9.2% | |
| n | 3444 | 8.1% | |
| i | 2987 | 7.1% | |
| r | 2493 | 5.9% | |
| o | 2273 | 5.4% | |
| l | 2148 | 5.1% | |
| t | 1541 | 3.6% | |
| 1316 | 3.1% | ||
| d | 1211 | 2.9% | |
| h | 1204 | 2.8% | |
| s | 1126 | 2.7% | |
| u | 1031 | 2.4% | |
| y | 1018 | 2.4% | |
| c | 853 | 2.0% | |
| A | 802 | 1.9% | |
| J | 788 | 1.9% | |
| m | 700 | 1.7% | |
| M | 618 | 1.5% | |
| S | 554 | 1.3% | |
| g | 545 | 1.3% | |
| B | 543 | 1.3% | |
| R | 520 | 1.2% | |
| K | 442 | 1.0% | |
| D | 439 | 1.0% | |
| Other values (54) | 5053 | 12.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 32872 | 77.8% | |
| Uppercase Letter | 7632 | 18.1% | |
| Space Separator | 1316 | 3.1% | |
| Other Punctuation | 365 | 0.9% | |
| Math Symbol | 39 | 0.1% | |
| Open Punctuation | 10 | < 0.1% | |
| Close Punctuation | 10 | < 0.1% | |
| Dash Punctuation | 9 | < 0.1% | |
| Decimal Number | 3 | < 0.1% | |
| Other Letter | 2 | < 0.1% | |
| Modifier Symbol | 1 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 802 | 10.5% | |
| J | 788 | 10.3% | |
| M | 618 | 8.1% | |
| S | 554 | 7.3% | |
| B | 543 | 7.1% | |
| R | 520 | 6.8% | |
| K | 442 | 5.8% | |
| D | 439 | 5.8% | |
| C | 434 | 5.7% | |
| T | 298 | 3.9% | |
| L | 291 | 3.8% | |
| N | 274 | 3.6% | |
| E | 272 | 3.6% | |
| H | 231 | 3.0% | |
| P | 193 | 2.5% | |
| G | 182 | 2.4% | |
| F | 159 | 2.1% | |
| W | 138 | 1.8% | |
| Z | 110 | 1.4% | |
| V | 107 | 1.4% | |
| I | 96 | 1.3% | |
| O | 70 | 0.9% | |
| Y | 48 | 0.6% | |
| X | 10 | 0.1% | |
| U | 7 | 0.1% | |
| Other values (2) | 6 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 4727 | 14.4% | |
| e | 3883 | 11.8% | |
| n | 3444 | 10.5% | |
| i | 2987 | 9.1% | |
| r | 2493 | 7.6% | |
| o | 2273 | 6.9% | |
| l | 2148 | 6.5% | |
| t | 1541 | 4.7% | |
| d | 1211 | 3.7% | |
| h | 1204 | 3.7% | |
| s | 1126 | 3.4% | |
| u | 1031 | 3.1% | |
| y | 1018 | 3.1% | |
| c | 853 | 2.6% | |
| m | 700 | 2.1% | |
| g | 545 | 1.7% | |
| b | 368 | 1.1% | |
| k | 263 | 0.8% | |
| v | 235 | 0.7% | |
| f | 179 | 0.5% | |
| w | 141 | 0.4% | |
| z | 131 | 0.4% | |
| p | 126 | 0.4% | |
| x | 112 | 0.3% | |
| j | 85 | 0.3% | |
| Other values (13) | 48 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1316 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| & | 316 | 86.6% | |
| . | 39 | 10.7% | |
| / | 4 | 1.1% | |
| ' | 3 | 0.8% | |
| , | 3 | 0.8% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 39 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 9 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 10 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 10 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 9 | 3 | 100.0% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| 姿 | 1 | 50.0% | |
| 漪 | 1 | 50.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 40500 | 95.8% | |
| Common | 1753 | 4.1% | |
| Cyrillic | 4 | < 0.1% | |
| Han | 2 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 4727 | 11.7% | |
| e | 3883 | 9.6% | |
| n | 3444 | 8.5% | |
| i | 2987 | 7.4% | |
| r | 2493 | 6.2% | |
| o | 2273 | 5.6% | |
| l | 2148 | 5.3% | |
| t | 1541 | 3.8% | |
| d | 1211 | 3.0% | |
| h | 1204 | 3.0% | |
| s | 1126 | 2.8% | |
| u | 1031 | 2.5% | |
| y | 1018 | 2.5% | |
| c | 853 | 2.1% | |
| A | 802 | 2.0% | |
| J | 788 | 1.9% | |
| m | 700 | 1.7% | |
| M | 618 | 1.5% | |
| S | 554 | 1.4% | |
| g | 545 | 1.3% | |
| B | 543 | 1.3% | |
| R | 520 | 1.3% | |
| K | 442 | 1.1% | |
| D | 439 | 1.1% | |
| C | 434 | 1.1% | |
| Other values (36) | 4176 | 10.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1316 | 75.1% | ||
| & | 316 | 18.0% | |
| . | 39 | 2.2% | |
| + | 39 | 2.2% | |
| ( | 10 | 0.6% | |
| ) | 10 | 0.6% | |
| - | 9 | 0.5% | |
| / | 4 | 0.2% | |
| ' | 3 | 0.2% | |
| , | 3 | 0.2% | |
| 9 | 3 | 0.2% | |
| ` | 1 | 0.1% |
Most frequent Han characters
| Value | Count | Frequency (%) | |
| 姿 | 1 | 50.0% | |
| 漪 | 1 | 50.0% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| Ю | 1 | 25.0% | |
| р | 1 | 25.0% | |
| и | 1 | 25.0% | |
| й | 1 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 42232 | 99.9% | |
| None | 20 | < 0.1% | |
| Cyrillic | 4 | < 0.1% | |
| CJK | 2 | < 0.1% | |
| Latin Ext Additional | 1 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 4727 | 11.2% | |
| e | 3883 | 9.2% | |
| n | 3444 | 8.2% | |
| i | 2987 | 7.1% | |
| r | 2493 | 5.9% | |
| o | 2273 | 5.4% | |
| l | 2148 | 5.1% | |
| t | 1541 | 3.6% | |
| 1316 | 3.1% | ||
| d | 1211 | 2.9% | |
| h | 1204 | 2.9% | |
| s | 1126 | 2.7% | |
| u | 1031 | 2.4% | |
| y | 1018 | 2.4% | |
| c | 853 | 2.0% | |
| A | 802 | 1.9% | |
| J | 788 | 1.9% | |
| m | 700 | 1.7% | |
| M | 618 | 1.5% | |
| S | 554 | 1.3% | |
| g | 545 | 1.3% | |
| B | 543 | 1.3% | |
| R | 520 | 1.2% | |
| K | 442 | 1.0% | |
| D | 439 | 1.0% | |
| Other values (39) | 5026 | 11.9% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| è | 10 | 50.0% | |
| é | 4 | 20.0% | |
| ñ | 1 | 5.0% | |
| ư | 1 | 5.0% | |
| ơ | 1 | 5.0% | |
| ï | 1 | 5.0% | |
| ö | 1 | 5.0% | |
| á | 1 | 5.0% |
Most frequent Latin Ext Additional characters
| Value | Count | Frequency (%) | |
| ỹ | 1 | 100.0% |
Most frequent CJK characters
| Value | Count | Frequency (%) | |
| 姿 | 1 | 50.0% | |
| 漪 | 1 | 50.0% |
Most frequent Cyrillic characters
| Value | Count | Frequency (%) | |
| Ю | 1 | 25.0% | |
| р | 1 | 25.0% | |
| и | 1 | 25.0% | |
| й | 1 | 25.0% |
| Distinct | 77 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.1 KiB |
| Near North Side | |
|---|---|
| West Town | |
| Lake View | |
| Logan Square | |
| Loop | 344 |
| Other values (72) |
| Value | Count | Frequency (%) | |
| Near North Side | 748 | 11.5% | |
| West Town | 730 | 11.2% | |
| Lake View | 581 | 8.9% | |
| Logan Square | 382 | 5.9% | |
| Loop | 344 | 5.3% | |
| Near West Side | 337 | 5.2% | |
| Lincoln Park | 313 | 4.8% | |
| Lower West Side | 190 | 2.9% | |
| Uptown | 182 | 2.8% | |
| Edgewater | 166 | 2.5% | |
| Irving Park | 158 | 2.4% | |
| Avondale | 140 | 2.1% | |
| Near South Side | 135 | 2.1% | |
| North Center | 127 | 1.9% | |
| Rogers Park | 120 | 1.8% | |
| Bridgeport | 115 | 1.8% | |
| Grand Boulevard | 112 | 1.7% | |
| Hyde Park | 104 | 1.6% | |
| East Garfield Park | 103 | 1.6% | |
| Lincoln Square | 92 | 1.4% | |
| Woodlawn | 91 | 1.4% | |
| South Shore | 84 | 1.3% | |
| West Ridge | 81 | 1.2% | |
| Portage Park | 79 | 1.2% | |
| Armour Square | 77 | 1.2% | |
| Other values (52) | 932 | 14.3% |
Frequencies of value counts
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 22 |
|---|---|
| Median length | 11 |
| Mean length | 11.13552047 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 8021 | 11.0% | |
| 6764 | 9.3% | ||
| r | 6010 | 8.3% | |
| a | 5516 | 7.6% | |
| o | 5341 | 7.4% | |
| t | 3720 | 5.1% | |
| n | 3495 | 4.8% | |
| i | 3226 | 4.4% | |
| d | 2803 | 3.9% | |
| S | 2333 | 3.2% | |
| N | 2196 | 3.0% | |
| w | 2148 | 3.0% | |
| L | 1991 | 2.7% | |
| s | 1985 | 2.7% | |
| k | 1852 | 2.5% | |
| W | 1508 | 2.1% | |
| h | 1480 | 2.0% | |
| g | 1442 | 2.0% | |
| l | 1330 | 1.8% | |
| P | 1319 | 1.8% | |
| u | 1315 | 1.8% | |
| T | 730 | 1.0% | |
| p | 641 | 0.9% | |
| V | 581 | 0.8% | |
| q | 551 | 0.8% | |
| Other values (21) | 4339 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 52586 | 72.4% | |
| Uppercase Letter | 13287 | 18.3% | |
| Space Separator | 6764 | 9.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 2333 | 17.6% | |
| N | 2196 | 16.5% | |
| L | 1991 | 15.0% | |
| W | 1508 | 11.3% | |
| P | 1319 | 9.9% | |
| T | 730 | 5.5% | |
| V | 581 | 4.4% | |
| G | 364 | 2.7% | |
| A | 353 | 2.7% | |
| E | 296 | 2.2% | |
| B | 284 | 2.1% | |
| C | 281 | 2.1% | |
| H | 229 | 1.7% | |
| R | 223 | 1.7% | |
| U | 182 | 1.4% | |
| I | 158 | 1.2% | |
| D | 97 | 0.7% | |
| K | 44 | 0.3% | |
| M | 41 | 0.3% | |
| J | 38 | 0.3% | |
| O | 22 | 0.2% | |
| F | 17 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 8021 | 15.3% | |
| r | 6010 | 11.4% | |
| a | 5516 | 10.5% | |
| o | 5341 | 10.2% | |
| t | 3720 | 7.1% | |
| n | 3495 | 6.6% | |
| i | 3226 | 6.1% | |
| d | 2803 | 5.3% | |
| w | 2148 | 4.1% | |
| s | 1985 | 3.8% | |
| k | 1852 | 3.5% | |
| h | 1480 | 2.8% | |
| g | 1442 | 2.7% | |
| l | 1330 | 2.5% | |
| u | 1315 | 2.5% | |
| p | 641 | 1.2% | |
| q | 551 | 1.0% | |
| c | 476 | 0.9% | |
| v | 420 | 0.8% | |
| m | 243 | 0.5% | |
| y | 210 | 0.4% | |
| f | 208 | 0.4% | |
| b | 153 | 0.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 6764 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 65873 | 90.7% | |
| Common | 6764 | 9.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 8021 | 12.2% | |
| r | 6010 | 9.1% | |
| a | 5516 | 8.4% | |
| o | 5341 | 8.1% | |
| t | 3720 | 5.6% | |
| n | 3495 | 5.3% | |
| i | 3226 | 4.9% | |
| d | 2803 | 4.3% | |
| S | 2333 | 3.5% | |
| N | 2196 | 3.3% | |
| w | 2148 | 3.3% | |
| L | 1991 | 3.0% | |
| s | 1985 | 3.0% | |
| k | 1852 | 2.8% | |
| W | 1508 | 2.3% | |
| h | 1480 | 2.2% | |
| g | 1442 | 2.2% | |
| l | 1330 | 2.0% | |
| P | 1319 | 2.0% | |
| u | 1315 | 2.0% | |
| T | 730 | 1.1% | |
| p | 641 | 1.0% | |
| V | 581 | 0.9% | |
| q | 551 | 0.8% | |
| c | 476 | 0.7% | |
| Other values (20) | 3863 | 5.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 6764 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 72637 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 8021 | 11.0% | |
| 6764 | 9.3% | ||
| r | 6010 | 8.3% | |
| a | 5516 | 7.6% | |
| o | 5341 | 7.4% | |
| t | 3720 | 5.1% | |
| n | 3495 | 4.8% | |
| i | 3226 | 4.4% | |
| d | 2803 | 3.9% | |
| S | 2333 | 3.2% | |
| N | 2196 | 3.0% | |
| w | 2148 | 3.0% | |
| L | 1991 | 2.7% | |
| s | 1985 | 2.7% | |
| k | 1852 | 2.5% | |
| W | 1508 | 2.1% | |
| h | 1480 | 2.0% | |
| g | 1442 | 2.0% | |
| l | 1330 | 1.8% | |
| P | 1319 | 1.8% | |
| u | 1315 | 1.8% | |
| T | 730 | 1.0% | |
| p | 641 | 0.9% | |
| V | 581 | 0.8% | |
| q | 551 | 0.8% | |
| Other values (21) | 4339 | 6.0% |
latitude
Real number (ℝ≥0)
| Distinct | 5168 |
|---|---|
| Distinct (%) | 79.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.89871965 |
|---|---|
| Minimum | 41.65156 |
| Maximum | 42.02259 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 41.65156 |
|---|---|
| 5-th percentile | 41.782255 |
| Q1 | 41.87348 |
| median | 41.90143 |
| Q3 | 41.939765 |
| 95-th percentile | 41.987144 |
| Maximum | 42.02259 |
| Range | 0.37103 |
| Interquartile range (IQR) | 0.066285 |
Descriptive statistics
| Standard deviation | 0.05904695304 |
|---|---|
| Coefficient of variation (CV) | 0.00140927822 |
| Kurtosis | 0.815250805 |
| Mean | 41.89871965 |
| Median Absolute Deviation (MAD) | 0.03475 |
| Skewness | -0.7368105194 |
| Sum | 273305.3483 |
| Variance | 0.003486542663 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 41.88306 | 35 | 0.5% | |
| 41.88608 | 31 | 0.5% | |
| 41.89111 | 30 | 0.5% | |
| 42.01653 | 18 | 0.3% | |
| 41.88558 | 16 | 0.2% | |
| 41.89063 | 13 | 0.2% | |
| 41.88302 | 12 | 0.2% | |
| 41.8989 | 12 | 0.2% | |
| 41.89622 | 12 | 0.2% | |
| 41.89862 | 11 | 0.2% | |
| 41.89235 | 11 | 0.2% | |
| 41.88606 | 11 | 0.2% | |
| 41.94041 | 11 | 0.2% | |
| 41.88309 | 10 | 0.2% | |
| 41.89453 | 9 | 0.1% | |
| 41.90323 | 8 | 0.1% | |
| 41.89502 | 8 | 0.1% | |
| 41.90895 | 8 | 0.1% | |
| 41.87711 | 7 | 0.1% | |
| 41.87723 | 7 | 0.1% | |
| 41.90653 | 7 | 0.1% | |
| 41.89621 | 7 | 0.1% | |
| 41.89902 | 7 | 0.1% | |
| 41.89988 | 6 | 0.1% | |
| 41.92214 | 6 | 0.1% | |
| Other values (5143) | 6210 | 95.2% |
| Value | Count | Frequency (%) | |
| 41.65156 | 1 | < 0.1% | |
| 41.65241 | 1 | < 0.1% | |
| 41.65301 | 1 | < 0.1% | |
| 41.65367 | 1 | < 0.1% | |
| 41.65648 | 1 | < 0.1% | |
| 41.66519 | 1 | < 0.1% | |
| 41.68582 | 1 | < 0.1% | |
| 41.68605 | 1 | < 0.1% | |
| 41.68612 | 1 | < 0.1% | |
| 41.68793 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 42.02259 | 1 | < 0.1% | |
| 42.02211 | 1 | < 0.1% | |
| 42.02197 | 1 | < 0.1% | |
| 42.02158 | 1 | < 0.1% | |
| 42.02139 | 1 | < 0.1% | |
| 42.02132 | 1 | < 0.1% | |
| 42.02092 | 1 | < 0.1% | |
| 42.02074 | 1 | < 0.1% | |
| 42.02073 | 1 | < 0.1% | |
| 42.02026 | 1 | < 0.1% |
longitude
Real number (ℝ)
| Distinct | 4981 |
|---|---|
| Distinct (%) | 76.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -87.66339754 |
|---|---|
| Minimum | -87.93434 |
| Maximum | -87.53782 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | -87.93434 |
|---|---|
| 5-th percentile | -87.735849 |
| Q1 | -87.68666 |
| median | -87.65959 |
| Q3 | -87.632985 |
| 95-th percentile | -87.604728 |
| Maximum | -87.53782 |
| Range | 0.39652 |
| Interquartile range (IQR) | 0.053675 |
Descriptive statistics
| Standard deviation | 0.04238696923 |
|---|---|
| Coefficient of variation (CV) | -0.0004835195808 |
| Kurtosis | 1.407428092 |
| Mean | -87.66339754 |
| Median Absolute Deviation (MAD) | 0.02671 |
| Skewness | -0.7005122763 |
| Sum | -571828.3422 |
| Variance | 0.001796655161 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| -87.63422 | 31 | 0.5% | |
| -87.62205 | 30 | 0.5% | |
| -87.65131 | 30 | 0.5% | |
| -87.6257 | 16 | 0.2% | |
| -87.61903 | 13 | 0.2% | |
| -87.68778 | 12 | 0.2% | |
| -87.62472 | 12 | 0.2% | |
| -87.62571 | 11 | 0.2% | |
| -87.62832 | 10 | 0.2% | |
| -87.62138 | 10 | 0.2% | |
| -87.62732 | 10 | 0.2% | |
| -87.63341 | 9 | 0.1% | |
| -87.62797 | 9 | 0.1% | |
| -87.63385 | 8 | 0.1% | |
| -87.7096 | 8 | 0.1% | |
| -87.62901 | 7 | 0.1% | |
| -87.6415 | 7 | 0.1% | |
| -87.65048 | 7 | 0.1% | |
| -87.62785 | 7 | 0.1% | |
| -87.62791 | 7 | 0.1% | |
| -87.62577 | 6 | 0.1% | |
| -87.66777 | 6 | 0.1% | |
| -87.63009 | 6 | 0.1% | |
| -87.6525 | 6 | 0.1% | |
| -87.67371 | 6 | 0.1% | |
| Other values (4956) | 6239 | 95.6% |
| Value | Count | Frequency (%) | |
| -87.93434 | 1 | < 0.1% | |
| -87.84674 | 1 | < 0.1% | |
| -87.84546 | 1 | < 0.1% | |
| -87.8443 | 1 | < 0.1% | |
| -87.84321 | 1 | < 0.1% | |
| -87.84193 | 1 | < 0.1% | |
| -87.84132 | 1 | < 0.1% | |
| -87.83699 | 1 | < 0.1% | |
| -87.83561 | 1 | < 0.1% | |
| -87.83528 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -87.53782 | 1 | < 0.1% | |
| -87.53942 | 1 | < 0.1% | |
| -87.54165 | 1 | < 0.1% | |
| -87.54423 | 1 | < 0.1% | |
| -87.54424 | 1 | < 0.1% | |
| -87.54542 | 1 | < 0.1% | |
| -87.54559 | 1 | < 0.1% | |
| -87.54561 | 1 | < 0.1% | |
| -87.54571 | 1 | < 0.1% | |
| -87.54597 | 1 | < 0.1% |
room_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.1 KiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 94 |
| Hotel room | 71 |
| Value | Count | Frequency (%) | |
| Entire home/apt | 4510 | 69.1% | |
| Private room | 1848 | 28.3% | |
| Shared room | 94 | 1.4% | |
| Hotel room | 71 | 1.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.03801932 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 11033 | 12.0% | |
| t | 10939 | 11.9% | |
| o | 8607 | 9.4% | |
| r | 8465 | 9.2% | |
| 6523 | 7.1% | ||
| m | 6523 | 7.1% | |
| a | 6452 | 7.0% | |
| i | 6358 | 6.9% | |
| h | 4604 | 5.0% | |
| E | 4510 | 4.9% | |
| n | 4510 | 4.9% | |
| / | 4510 | 4.9% | |
| p | 4510 | 4.9% | |
| P | 1848 | 2.0% | |
| v | 1848 | 2.0% | |
| S | 94 | 0.1% | |
| d | 94 | 0.1% | |
| H | 71 | 0.1% | |
| l | 71 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 74014 | 80.8% | |
| Uppercase Letter | 6523 | 7.1% | |
| Space Separator | 6523 | 7.1% | |
| Other Punctuation | 4510 | 4.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| E | 4510 | 69.1% | |
| P | 1848 | 28.3% | |
| S | 94 | 1.4% | |
| H | 71 | 1.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 11033 | 14.9% | |
| t | 10939 | 14.8% | |
| o | 8607 | 11.6% | |
| r | 8465 | 11.4% | |
| m | 6523 | 8.8% | |
| a | 6452 | 8.7% | |
| i | 6358 | 8.6% | |
| h | 4604 | 6.2% | |
| n | 4510 | 6.1% | |
| p | 4510 | 6.1% | |
| v | 1848 | 2.5% | |
| d | 94 | 0.1% | |
| l | 71 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 6523 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 4510 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 80537 | 88.0% | |
| Common | 11033 | 12.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 11033 | 13.7% | |
| t | 10939 | 13.6% | |
| o | 8607 | 10.7% | |
| r | 8465 | 10.5% | |
| m | 6523 | 8.1% | |
| a | 6452 | 8.0% | |
| i | 6358 | 7.9% | |
| h | 4604 | 5.7% | |
| E | 4510 | 5.6% | |
| n | 4510 | 5.6% | |
| p | 4510 | 5.6% | |
| P | 1848 | 2.3% | |
| v | 1848 | 2.3% | |
| S | 94 | 0.1% | |
| d | 94 | 0.1% | |
| H | 71 | 0.1% | |
| l | 71 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 6523 | 59.1% | ||
| / | 4510 | 40.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 91570 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 11033 | 12.0% | |
| t | 10939 | 11.9% | |
| o | 8607 | 9.4% | |
| r | 8465 | 9.2% | |
| 6523 | 7.1% | ||
| m | 6523 | 7.1% | |
| a | 6452 | 7.0% | |
| i | 6358 | 6.9% | |
| h | 4604 | 5.0% | |
| E | 4510 | 4.9% | |
| n | 4510 | 4.9% | |
| / | 4510 | 4.9% | |
| p | 4510 | 4.9% | |
| P | 1848 | 2.0% | |
| v | 1848 | 2.0% | |
| S | 94 | 0.1% | |
| d | 94 | 0.1% | |
| H | 71 | 0.1% | |
| l | 71 | 0.1% |
| Distinct | 504 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 150.062088 |
|---|---|
| Minimum | 0 |
| Maximum | 10000 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 60 |
| median | 94 |
| Q3 | 150 |
| 95-th percentile | 400 |
| Maximum | 10000 |
| Range | 10000 |
| Interquartile range (IQR) | 90 |
Descriptive statistics
| Standard deviation | 371.5814529 |
|---|---|
| Coefficient of variation (CV) | 2.476184743 |
| Kurtosis | 507.6916875 |
| Mean | 150.062088 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 20.25399533 |
| Sum | 978855 |
| Variance | 138072.7761 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 75 | 167 | 2.6% | |
| 50 | 144 | 2.2% | |
| 100 | 138 | 2.1% | |
| 65 | 119 | 1.8% | |
| 60 | 114 | 1.7% | |
| 80 | 114 | 1.7% | |
| 70 | 110 | 1.7% | |
| 150 | 110 | 1.7% | |
| 45 | 107 | 1.6% | |
| 85 | 100 | 1.5% | |
| 99 | 96 | 1.5% | |
| 200 | 91 | 1.4% | |
| 35 | 88 | 1.3% | |
| 90 | 86 | 1.3% | |
| 55 | 86 | 1.3% | |
| 125 | 82 | 1.3% | |
| 95 | 81 | 1.2% | |
| 120 | 74 | 1.1% | |
| 110 | 70 | 1.1% | |
| 89 | 68 | 1.0% | |
| 49 | 67 | 1.0% | |
| 40 | 66 | 1.0% | |
| 30 | 65 | 1.0% | |
| 400 | 65 | 1.0% | |
| 59 | 64 | 1.0% | |
| Other values (479) | 4151 | 63.6% |
| Value | Count | Frequency (%) | |
| 0 | 5 | 0.1% | |
| 10 | 3 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 14 | 4 | 0.1% | |
| 15 | 8 | 0.1% | |
| 16 | 6 | 0.1% | |
| 17 | 9 | 0.1% | |
| 18 | 5 | 0.1% | |
| 19 | 7 | 0.1% | |
| 20 | 14 | 0.2% |
| Value | Count | Frequency (%) | |
| 10000 | 1 | < 0.1% | |
| 9999 | 5 | 0.1% | |
| 9000 | 1 | < 0.1% | |
| 3690 | 1 | < 0.1% | |
| 3500 | 1 | < 0.1% | |
| 3429 | 1 | < 0.1% | |
| 3070 | 1 | < 0.1% | |
| 3000 | 1 | < 0.1% | |
| 2773 | 1 | < 0.1% | |
| 2507 | 1 | < 0.1% |
minimum_nights
Real number (ℝ≥0)
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.231488579 |
|---|---|
| Minimum | 1 |
| Maximum | 500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 32 |
| Maximum | 500 |
| Range | 499 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 22.38369529 |
|---|---|
| Coefficient of variation (CV) | 2.719276723 |
| Kurtosis | 161.0444242 |
| Mean | 8.231488579 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 10.65131132 |
| Sum | 53694 |
| Variance | 501.0298148 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2 | 2121 | 32.5% | |
| 1 | 2000 | 30.7% | |
| 3 | 765 | 11.7% | |
| 30 | 398 | 6.1% | |
| 4 | 175 | 2.7% | |
| 7 | 165 | 2.5% | |
| 31 | 141 | 2.2% | |
| 32 | 135 | 2.1% | |
| 5 | 120 | 1.8% | |
| 14 | 68 | 1.0% | |
| 33 | 60 | 0.9% | |
| 60 | 51 | 0.8% | |
| 10 | 51 | 0.8% | |
| 28 | 47 | 0.7% | |
| 6 | 31 | 0.5% | |
| 90 | 19 | 0.3% | |
| 20 | 18 | 0.3% | |
| 15 | 17 | 0.3% | |
| 41 | 16 | 0.2% | |
| 21 | 12 | 0.2% | |
| 365 | 11 | 0.2% | |
| 8 | 10 | 0.2% | |
| 29 | 9 | 0.1% | |
| 25 | 8 | 0.1% | |
| 27 | 7 | 0.1% | |
| Other values (36) | 68 | 1.0% |
| Value | Count | Frequency (%) | |
| 1 | 2000 | 30.7% | |
| 2 | 2121 | 32.5% | |
| 3 | 765 | 11.7% | |
| 4 | 175 | 2.7% | |
| 5 | 120 | 1.8% | |
| 6 | 31 | 0.5% | |
| 7 | 165 | 2.5% | |
| 8 | 10 | 0.2% | |
| 9 | 3 | < 0.1% | |
| 10 | 51 | 0.8% |
| Value | Count | Frequency (%) | |
| 500 | 1 | < 0.1% | |
| 365 | 11 | 0.2% | |
| 360 | 1 | < 0.1% | |
| 210 | 1 | < 0.1% | |
| 200 | 1 | < 0.1% | |
| 185 | 1 | < 0.1% | |
| 182 | 1 | < 0.1% | |
| 180 | 7 | 0.1% | |
| 179 | 1 | < 0.1% | |
| 168 | 1 | < 0.1% |
| Distinct | 338 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.67162349 |
|---|---|
| Minimum | 0 |
| Maximum | 655 |
| Zeros | 1285 |
| Zeros (%) | 19.7% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 13 |
| Q3 | 53 |
| 95-th percentile | 176.9 |
| Maximum | 655 |
| Range | 655 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 67.2569877 |
|---|---|
| Coefficient of variation (CV) | 1.6139757 |
| Kurtosis | 12.65643683 |
| Mean | 41.67162349 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 2.988450645 |
| Sum | 271824 |
| Variance | 4523.502395 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1285 | 19.7% | |
| 1 | 467 | 7.2% | |
| 2 | 276 | 4.2% | |
| 3 | 236 | 3.6% | |
| 4 | 178 | 2.7% | |
| 5 | 141 | 2.2% | |
| 6 | 127 | 1.9% | |
| 7 | 106 | 1.6% | |
| 8 | 100 | 1.5% | |
| 9 | 84 | 1.3% | |
| 13 | 83 | 1.3% | |
| 10 | 83 | 1.3% | |
| 12 | 78 | 1.2% | |
| 16 | 70 | 1.1% | |
| 11 | 63 | 1.0% | |
| 24 | 58 | 0.9% | |
| 20 | 58 | 0.9% | |
| 18 | 57 | 0.9% | |
| 15 | 57 | 0.9% | |
| 14 | 56 | 0.9% | |
| 25 | 56 | 0.9% | |
| 19 | 55 | 0.8% | |
| 21 | 54 | 0.8% | |
| 23 | 53 | 0.8% | |
| 17 | 53 | 0.8% | |
| Other values (313) | 2589 | 39.7% |
| Value | Count | Frequency (%) | |
| 0 | 1285 | 19.7% | |
| 1 | 467 | 7.2% | |
| 2 | 276 | 4.2% | |
| 3 | 236 | 3.6% | |
| 4 | 178 | 2.7% | |
| 5 | 141 | 2.2% | |
| 6 | 127 | 1.9% | |
| 7 | 106 | 1.6% | |
| 8 | 100 | 1.5% | |
| 9 | 84 | 1.3% |
| Value | Count | Frequency (%) | |
| 655 | 1 | < 0.1% | |
| 641 | 1 | < 0.1% | |
| 626 | 1 | < 0.1% | |
| 570 | 1 | < 0.1% | |
| 541 | 1 | < 0.1% | |
| 524 | 1 | < 0.1% | |
| 518 | 1 | < 0.1% | |
| 513 | 1 | < 0.1% | |
| 508 | 1 | < 0.1% | |
| 505 | 1 | < 0.1% |
| Distinct | 820 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 1285 |
| Missing (%) | 19.7% |
| Memory size | 51.1 KiB |
| 11/29/20 | 159 |
|---|---|
| 12/13/20 | 117 |
| 3/15/20 | 110 |
| 11/30/20 | 95 |
| 2/16/20 | 95 |
| Other values (815) |
| Value | Count | Frequency (%) | |
| 11/29/20 | 159 | 2.4% | |
| 12/13/20 | 117 | 1.8% | |
| 3/15/20 | 110 | 1.7% | |
| 11/30/20 | 95 | 1.5% | |
| 2/16/20 | 95 | 1.5% | |
| 11/28/20 | 93 | 1.4% | |
| 12/6/20 | 80 | 1.2% | |
| 11/15/20 | 68 | 1.0% | |
| 11/8/20 | 67 | 1.0% | |
| 10/25/20 | 63 | 1.0% | |
| 11/22/20 | 62 | 1.0% | |
| 12/14/20 | 59 | 0.9% | |
| 12/5/20 | 51 | 0.8% | |
| 10/18/20 | 49 | 0.8% | |
| 10/20/19 | 48 | 0.7% | |
| 12/7/20 | 45 | 0.7% | |
| 12/15/20 | 45 | 0.7% | |
| 11/27/20 | 44 | 0.7% | |
| 2/17/20 | 44 | 0.7% | |
| 10/31/20 | 43 | 0.7% | |
| 11/23/20 | 43 | 0.7% | |
| 3/16/20 | 43 | 0.7% | |
| 10/11/20 | 43 | 0.7% | |
| 12/1/20 | 41 | 0.6% | |
| 11/21/20 | 40 | 0.6% | |
| Other values (795) | 3591 | 55.1% | |
| (Missing) | 1285 | 19.7% |
Frequencies of value counts
Unique
| Unique | 340 ? |
|---|---|
| Unique (%) | 6.5% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.463437069 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| / | 10476 | 24.8% | |
| 1 | 8261 | 19.6% | |
| 2 | 7212 | 17.1% | |
| 0 | 5425 | 12.9% | |
| n | 2570 | 6.1% | |
| 9 | 1782 | 4.2% | |
| 3 | 1292 | 3.1% | |
| a | 1285 | 3.0% | |
| 8 | 1008 | 2.4% | |
| 7 | 795 | 1.9% | |
| 6 | 766 | 1.8% | |
| 5 | 725 | 1.7% | |
| 4 | 564 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 27830 | 66.0% | |
| Other Punctuation | 10476 | 24.8% | |
| Lowercase Letter | 3855 | 9.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 8261 | 29.7% | |
| 2 | 7212 | 25.9% | |
| 0 | 5425 | 19.5% | |
| 9 | 1782 | 6.4% | |
| 3 | 1292 | 4.6% | |
| 8 | 1008 | 3.6% | |
| 7 | 795 | 2.9% | |
| 6 | 766 | 2.8% | |
| 5 | 725 | 2.6% | |
| 4 | 564 | 2.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 10476 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 2570 | 66.7% | |
| a | 1285 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 38306 | 90.9% | |
| Latin | 3855 | 9.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| / | 10476 | 27.3% | |
| 1 | 8261 | 21.6% | |
| 2 | 7212 | 18.8% | |
| 0 | 5425 | 14.2% | |
| 9 | 1782 | 4.7% | |
| 3 | 1292 | 3.4% | |
| 8 | 1008 | 2.6% | |
| 7 | 795 | 2.1% | |
| 6 | 766 | 2.0% | |
| 5 | 725 | 1.9% | |
| 4 | 564 | 1.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 2570 | 66.7% | |
| a | 1285 | 33.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 42161 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| / | 10476 | 24.8% | |
| 1 | 8261 | 19.6% | |
| 2 | 7212 | 17.1% | |
| 0 | 5425 | 12.9% | |
| n | 2570 | 6.1% | |
| 9 | 1782 | 4.2% | |
| 3 | 1292 | 3.1% | |
| a | 1285 | 3.0% | |
| 8 | 1008 | 2.4% | |
| 7 | 795 | 1.9% | |
| 6 | 766 | 1.8% | |
| 5 | 725 | 1.7% | |
| 4 | 564 | 1.3% |
| Distinct | 644 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 1285 |
| Missing (%) | 19.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.65593929 |
|---|---|
| Minimum | 0.01 |
| Maximum | 32.41 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.08 |
| Q1 | 0.39 |
| median | 1.12 |
| Q3 | 2.45 |
| 95-th percentile | 4.7815 |
| Maximum | 32.41 |
| Range | 32.4 |
| Interquartile range (IQR) | 2.06 |
Descriptive statistics
| Standard deviation | 1.727131468 |
|---|---|
| Coefficient of variation (CV) | 1.042992022 |
| Kurtosis | 31.03968472 |
| Mean | 1.65593929 |
| Median Absolute Deviation (MAD) | 0.87 |
| Skewness | 3.231401101 |
| Sum | 8673.81 |
| Variance | 2.982983106 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1 | 111 | 1.7% | |
| 0.1 | 55 | 0.8% | |
| 0.06 | 52 | 0.8% | |
| 0.11 | 51 | 0.8% | |
| 0.19 | 51 | 0.8% | |
| 0.07 | 50 | 0.8% | |
| 0.08 | 49 | 0.8% | |
| 0.16 | 49 | 0.8% | |
| 0.21 | 47 | 0.7% | |
| 0.03 | 44 | 0.7% | |
| 0.13 | 44 | 0.7% | |
| 0.17 | 44 | 0.7% | |
| 0.09 | 43 | 0.7% | |
| 0.05 | 38 | 0.6% | |
| 0.14 | 38 | 0.6% | |
| 0.29 | 37 | 0.6% | |
| 0.12 | 37 | 0.6% | |
| 0.2 | 35 | 0.5% | |
| 0.31 | 34 | 0.5% | |
| 0.38 | 32 | 0.5% | |
| 0.18 | 32 | 0.5% | |
| 0.25 | 32 | 0.5% | |
| 0.28 | 31 | 0.5% | |
| 2 | 30 | 0.5% | |
| 0.04 | 29 | 0.4% | |
| Other values (619) | 4143 | 63.5% | |
| (Missing) | 1285 | 19.7% |
| Value | Count | Frequency (%) | |
| 0.01 | 1 | < 0.1% | |
| 0.02 | 20 | 0.3% | |
| 0.03 | 44 | 0.7% | |
| 0.04 | 29 | 0.4% | |
| 0.05 | 38 | 0.6% | |
| 0.06 | 52 | 0.8% | |
| 0.07 | 50 | 0.8% | |
| 0.08 | 49 | 0.8% | |
| 0.09 | 43 | 0.7% | |
| 0.1 | 55 | 0.8% |
| Value | Count | Frequency (%) | |
| 32.41 | 1 | < 0.1% | |
| 24.34 | 1 | < 0.1% | |
| 19.56 | 1 | < 0.1% | |
| 17.16 | 1 | < 0.1% | |
| 13.86 | 1 | < 0.1% | |
| 13.74 | 1 | < 0.1% | |
| 12.08 | 1 | < 0.1% | |
| 11.35 | 1 | < 0.1% | |
| 11.03 | 1 | < 0.1% | |
| 11 | 1 | < 0.1% |
calculated_host_listings_count
Real number (ℝ≥0)
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.44718688 |
|---|---|
| Minimum | 1 |
| Maximum | 216 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 8 |
| 95-th percentile | 63 |
| Maximum | 216 |
| Range | 215 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 39.62176771 |
|---|---|
| Coefficient of variation (CV) | 2.742524759 |
| Kurtosis | 19.29870307 |
| Mean | 14.44718688 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.413201886 |
| Sum | 94239 |
| Variance | 1569.884477 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=34)
| Value | Count | Frequency (%) | |
| 1 | 2708 | 41.5% | |
| 2 | 860 | 13.2% | |
| 3 | 441 | 6.8% | |
| 4 | 376 | 5.8% | |
| 5 | 235 | 3.6% | |
| 216 | 216 | 3.3% | |
| 6 | 174 | 2.7% | |
| 12 | 120 | 1.8% | |
| 9 | 117 | 1.8% | |
| 8 | 96 | 1.5% | |
| 7 | 91 | 1.4% | |
| 40 | 80 | 1.2% | |
| 74 | 74 | 1.1% | |
| 10 | 70 | 1.1% | |
| 11 | 66 | 1.0% | |
| 63 | 63 | 1.0% | |
| 15 | 60 | 0.9% | |
| 20 | 60 | 0.9% | |
| 30 | 60 | 0.9% | |
| 18 | 54 | 0.8% | |
| 25 | 50 | 0.8% | |
| 50 | 50 | 0.8% | |
| 49 | 49 | 0.8% | |
| 16 | 48 | 0.7% | |
| 47 | 47 | 0.7% | |
| Other values (9) | 258 | 4.0% |
| Value | Count | Frequency (%) | |
| 1 | 2708 | 41.5% | |
| 2 | 860 | 13.2% | |
| 3 | 441 | 6.8% | |
| 4 | 376 | 5.8% | |
| 5 | 235 | 3.6% | |
| 6 | 174 | 2.7% | |
| 7 | 91 | 1.4% | |
| 8 | 96 | 1.5% | |
| 9 | 117 | 1.8% | |
| 10 | 70 | 1.1% |
| Value | Count | Frequency (%) | |
| 216 | 216 | 3.3% | |
| 74 | 74 | 1.1% | |
| 63 | 63 | 1.0% | |
| 50 | 50 | 0.8% | |
| 49 | 49 | 0.8% | |
| 47 | 47 | 0.7% | |
| 40 | 80 | 1.2% | |
| 35 | 35 | 0.5% | |
| 31 | 31 | 0.5% | |
| 30 | 60 | 0.9% |
| Distinct | 361 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 160.5874598 |
|---|---|
| Minimum | 0 |
| Maximum | 365 |
| Zeros | 1797 |
| Zeros (%) | 27.5% |
| Memory size | 51.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 123 |
| Q3 | 333 |
| 95-th percentile | 365 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 333 |
Descriptive statistics
| Standard deviation | 144.3194377 |
|---|---|
| Coefficient of variation (CV) | 0.8986968094 |
| Kurtosis | -1.550568682 |
| Mean | 160.5874598 |
| Median Absolute Deviation (MAD) | 123 |
| Skewness | 0.271537519 |
| Sum | 1047512 |
| Variance | 20828.1001 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1797 | 27.5% | |
| 365 | 507 | 7.8% | |
| 90 | 164 | 2.5% | |
| 364 | 162 | 2.5% | |
| 180 | 126 | 1.9% | |
| 353 | 106 | 1.6% | |
| 89 | 101 | 1.5% | |
| 78 | 89 | 1.4% | |
| 363 | 82 | 1.3% | |
| 179 | 74 | 1.1% | |
| 88 | 67 | 1.0% | |
| 360 | 61 | 0.9% | |
| 168 | 50 | 0.8% | |
| 362 | 49 | 0.8% | |
| 358 | 46 | 0.7% | |
| 361 | 46 | 0.7% | |
| 322 | 44 | 0.7% | |
| 178 | 43 | 0.7% | |
| 354 | 38 | 0.6% | |
| 359 | 36 | 0.6% | |
| 352 | 35 | 0.5% | |
| 356 | 34 | 0.5% | |
| 351 | 33 | 0.5% | |
| 294 | 33 | 0.5% | |
| 83 | 33 | 0.5% | |
| Other values (336) | 2667 | 40.9% |
| Value | Count | Frequency (%) | |
| 0 | 1797 | 27.5% | |
| 1 | 27 | 0.4% | |
| 2 | 4 | 0.1% | |
| 3 | 6 | 0.1% | |
| 4 | 7 | 0.1% | |
| 5 | 6 | 0.1% | |
| 6 | 10 | 0.2% | |
| 7 | 9 | 0.1% | |
| 8 | 6 | 0.1% | |
| 9 | 4 | 0.1% |
| Value | Count | Frequency (%) | |
| 365 | 507 | 7.8% | |
| 364 | 162 | 2.5% | |
| 363 | 82 | 1.3% | |
| 362 | 49 | 0.8% | |
| 361 | 46 | 0.7% | |
| 360 | 61 | 0.9% | |
| 359 | 36 | 0.6% | |
| 358 | 46 | 0.7% | |
| 357 | 32 | 0.5% | |
| 356 | 34 | 0.5% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2384 | Hyde Park - Walk to UChicago, 10 min to McCormick | 2613 | Rebecca | Hyde Park | 41.78790 | -87.58780 | Private room | 70 | 2 | 181 | 10/29/20 | 2.50 | 1 | 0 |
| 1 | 4505 | 394 Great Reviews. 127 y/o House. 40 yds to train. | 5775 | Craig & Kathleen | South Lawndale | 41.85495 | -87.69696 | Entire home/apt | 95 | 2 | 395 | 7/14/20 | 2.75 | 1 | 170 |
| 2 | 7126 | Tiny Studio Apartment 94 Walk Score | 17928 | Sarah | West Town | 41.90289 | -87.68182 | Entire home/apt | 60 | 2 | 387 | 11/16/20 | 2.77 | 1 | 0 |
| 3 | 9811 | Barbara's Hideaway - Old Town | 33004 | At Home Inn | Lincoln Park | 41.91769 | -87.63788 | Entire home/apt | 65 | 4 | 53 | 11/30/20 | 0.65 | 11 | 276 |
| 4 | 10610 | 3 Comforts of Cooperative Living | 2140 | Lois | Hyde Park | 41.79612 | -87.59261 | Private room | 20 | 1 | 45 | 9/15/20 | 0.60 | 2 | 0 |
| 5 | 10945 | The Biddle House (#1) | 33004 | At Home Inn | Lincoln Park | 41.91183 | -87.64000 | Entire home/apt | 116 | 4 | 21 | 11/21/20 | 0.26 | 11 | 83 |
| 6 | 12140 | Lincoln Park Guest House | 46734 | Sharon And Robert | Lincoln Park | 41.92335 | -87.64951 | Private room | 289 | 2 | 4 | 10/17/18 | 0.06 | 1 | 179 |
| 7 | 22362 | Luxury in Chicago! 2BR/ 2Ba / Parking / BBQ | 85811 | Craig | West Town | 41.89617 | -87.66041 | Entire home/apt | 99 | 91 | 9 | 10/12/14 | 0.11 | 2 | 365 |
| 8 | 24833 | Private Apt 1 Block to Fullerton L Red Line - Deck | 101521 | Red | Lincoln Park | 41.92679 | -87.65521 | Entire home/apt | 34 | 32 | 37 | 7/29/18 | 0.29 | 3 | 111 |
| 9 | 25879 | Top 2/1 Block to Fullerton L Red Line Deck & Yard | 101521 | Red | Lincoln Park | 41.92693 | -87.65753 | Entire home/apt | 94 | 32 | 47 | 5/11/20 | 0.37 | 3 | 152 |
Last rows
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6513 | 47115140 | Upscale & Cozy 2bd 1bt Apt Mins From Dwntwn Beach | 27619788 | Crystal | Woodlawn | 41.776940 | -87.608800 | Entire home/apt | 63 | 2 | 0 | NaN | NaN | 1 | 166 |
| 6514 | 47116894 | Height of Luxury! Airy & Elegant River North Home | 376283542 | Antoinette | Near North Side | 41.894940 | -87.635130 | Entire home/apt | 1599 | 1 | 0 | NaN | NaN | 1 | 353 |
| 6515 | 47118155 | New 3 bedroom 2 bathroom apartment in rogers park. | 379312368 | Rafat | West Ridge | 41.997890 | -87.688800 | Entire home/apt | 76 | 1 | 0 | NaN | NaN | 2 | 365 |
| 6516 | 47121422 | Upscale Southport Abode near Everything--2 bed | 6187354 | Erin | Lake View | 41.949880 | -87.667290 | Entire home/apt | 72 | 14 | 0 | NaN | NaN | 1 | 62 |
| 6517 | 47123944 | Chicago Loft | 342913217 | Kirsten | North Center | 41.937440 | -87.684830 | Entire home/apt | 1800 | 1 | 0 | NaN | NaN | 1 | 62 |
| 6518 | 47126307 | The Humboldt Jungalow | 4657251 | Nathan | Humboldt Park | 41.904030 | -87.716110 | Entire home/apt | 180 | 4 | 0 | NaN | NaN | 1 | 78 |
| 6519 | 47126361 | Lovely Flat Close to Rush Hospital / United Centre | 380761555 | Ekrem | Near West Side | 41.879120 | -87.681380 | Private room | 21 | 1 | 0 | NaN | NaN | 1 | 362 |
| 6520 | 47137445 | Entire apartment in West Town with Rooftop | 189403517 | Eugene | West Town | 41.900750 | -87.664920 | Entire home/apt | 115 | 10 | 0 | NaN | NaN | 1 | 14 |
| 6521 | 47140245 | Vintage 3BR hideaway near UChicago Med, Sanitized | 100179 | Kenneth | Woodlawn | 41.784033 | -87.610653 | Entire home/apt | 46 | 2 | 0 | NaN | NaN | 10 | 71 |
| 6522 | 47141177 | Modern Room in a Chicago Bungalow | 6617501 | Nnamdi | South Shore | 41.756159 | -87.585177 | Private room | 23 | 2 | 0 | NaN | NaN | 12 | 84 |